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5 FIELD OF PnTVTNTION 

This invention relates to the expression and secretion of recombinant fusion 
proteins from Caulobacter wherein a heterologous polypeptide is fused with all or part 
of the surface layer protein (S-iayer protein) of the bacterium. 

10 

BACKGROLTST) OF THE INVENTION 

Many bacteria assemble layers composed of repetitive, regularly aligned, 
proteinaceous sub-units on the outer surface of the cell. These layers are essentially 

15 two-dimensional paracrystalline arrays, and being the outer molecular layer of the 
organism, directly interface with the environment. In Caulobacter , the S-layer protein 
is synthesized by the cell in large quantities and the S-layer completely envelops the cell 
and thus appears to be a protective layer. 

Caulobacter are natural inhabitants of most soil and freshwater environments 

2 0 and may persist in waste water treatment systems and effluents. The bacteria alternate 
between a stalked cell that is attached to a surface, and an adhesive motile dispersal cell 
that searches to find a new surface upon which to stick and convert to a stalked cell. 
The bacteria attach tenaciously to nearly all surfaces and do so without producing the 
extracellular enzymes or polysaccharide "slimes" that are characteristic of most other 

2 5 surface attached bacteria. Caulobacters have simple requirements for growth. The 

organism is ubiquitous in the environment and has been isolated from oligotrophic to 
mesotrophic situations. They are known for their ability to tolerate low nutrient level 
stresses, for example, low phosphate levels. 

All of the freshwater Caulobacter that produce an S-layer are similar and have 

3 0 S- layers that are substantially the same under election microscopy. The layers are 

hexagonally arranged in all cases, with a similar centre-centre dimension (see: Walker. 
S.G., et al... (1992). "Isolation and Comparison of the Paracrystalline Surface Layer 
Proteins of Freshwater Caulobacters" J. Bacteriol. 174: 1783-1792). 
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16S rRNA sequence analysis of several S-layer producing Caulobacter strains 
show that they group ciosely (see: Stahl, D.A. er al. (1992) "The Phylogeny of Marine 
and Freshwater Caulobacters Reflects Their Habitat" J. Bacteriol. 174: 2193-2198). 
DNA probing of Southern blots using the S-layer gene from C. crescentus CB15 
5 identifies a single band that is consistent with the presence of a cognate gene (see: 
MacRae, J.D. and, J. Smit. (1991) "Characterization of Caulobacters Isolated from 
Wastewater Treatment Systems" Applied and Environmental Microbiology 57:751- 
758). Furthermore, antisera raised against the S-layer protein of CB15 reacts against 
the S-layer protein of other Caulobacter (see: Walker, S.G. et al. (1992) [supra]). All 

10 S-layer proteins isolated from Caulobacter may be substantially purified using the same 
methods. All strains appear to have a polysaccharide species which may be required 
for S-layer attachment (see: Walker, S.G. et al. (1992) [supra]). 

The S- layers elaborated by freshwater isolates of Caulobacter are visibly 
indistinguishable from the S-layer produced by Caulobacter strains CB2 and CB15. 

15 The S-layer proteins from the latter strains have approximately 100,000 m.w. although 
sizes of S-layer proteins from other species and strains will var\\ The hydrophillic S- 
layer protein has been characterized both structurally and chemically. It is composed of 
ring-like structures spaced at 22 nm intervals arranged in a hexagonal manner on the 
outer membrane The S-layer is bound to the bacterial surface and may be removed by 

2 0 low pH treatment or by treatment with a calcium chelator such as EDTA. 

The similarity of S-layer proteins in different strains of Caulobacter permits the 
use of a cloned S-layer protein gene of one Caulobacter strain for retrieval of the 
corresponding gene in other Caulobacter strains (see: Walker, S.G. et al. (1992) 
[supra]: and MacRae, J.D. et al. (1991) [supra]), 

2 5 Expression of a heterologous polypeptide as a fusion product with the S-layer 

protein of Caulobacter provides advantages not previously seen in systems for 
production of recombinant fusion proteins using other organisms such as E. coli and 
Salmonella . All known Caulobacter strains are believed to be harmless and are nearly 
ubiquitous in aquatic environments. In contrast, many Salmonella and E. coli su-ains 

3 0 are pathogens. Consequently, expression and secretion of a heterologous polypeptide 

using Caulobacter as a vehicle has the advantage that the expression system will be 
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Stable in a variety of outdoor environments and may not present problems associated 
with the use of a pathogenic organism. Furthermore, Caulobacter are natural biofilm 
forming species and may be adapted for use in fixed biofilm bioreators. The quantity of 
S-layer protein that is synthesized and is secreted by Caulobacter is high, reaching 12% 
5 of the cell protein. 

There is an existing need to produce pure proteins and peptides in an economical 
manner and in a manner that minimizes or simplifies the purification steps needed after 
fermentation. Key commercial areas include the production of recombinant human and 
animal therapeutic antibiotic and vaccine peptides, industrial enzymes, protein 
10 polymers, and antibacterial enzymes for foodstuffs. Many of these commercial 
applications require low production costs and there are few expression systems available 
that can meet such cost restraints. In addition, there are numerous research applications 
where rapid methods to produce and purify proteins are needed to facilitate the 
discover)^ stage. This is especially true where there is a desire to express a large 
15 number of proteins with unknov^'n function (from a collections of cloned cDNA's, for 
example) or a large number of variants of a single protein, (for example, resulting from 
site directed mutagenesis) in a search for variants with improved properties. 

Generalh , proteins must be secreted to be produced at low cost. The primary 
reason is the much reduced cost of purification of the target protein from cell material. 
2 0 However, even for secreted proteins, simple methods of separating the product from 
spent culture and cells are important for cost reduction and ease of use. 

An international patent application published as WO 97/34000 on September 18. 
1997 describes the expression and secretion of recombinant proteins from Caulobacter 
in which the recombinant protein is a fusion of all or part of Caulobacter S-layer protein 

2 5 with a heterologous protein of interest (also see: Single, W.H., et al. 1997' "Linker 

Mutagenesis of the Caulobacter us S-layer protein: Toward a Definition of an N- 
lerminal Anchoring Region and a C-terminal Secretion Signal and the Potential for 
Heterologous Protein Secretion", J. Bacteriol. 179:601-611). 

The Caulobacter S-layer secretion apparatus is in the category of "Type 1" 

3 0 secretion ustially found in pathogenic bacteria and noted for its ability to secrete a wide 

variety of proteins including large and hydrophillic proteins. The Caulobacter protein 
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secretion system is panicularly useful to secrete recombinant proteins. 

Tne Cauiobacter 3-layer Type 1 secretion pathway requires only a C-terminal 
secretion signal, ty^pically comprising about 200 amino acids at the end of the protein. 
The export mechanism is capable of tolerating a wide variety of foreign proteins. 
5 Recombinant proteins may be conveniently produced as fusion proteins with the target 
protein being fused to the C-terminal secretion signal. Depending on the application, it 
may be desirable to remove the secretion signal following secretion. Not removing the 
secretion signal may be an approach suitable for many subunit vaccine applications, 
where the remaining S-layer protein serves as a carrier. 

10 A unique and desirable feature of fusion proteins produced by the Cauiobacter 

S-layer protein secretion system is that they form insoluble aggregates in the culture 
medium. This is apparently a consequence of the S-layer sequences associated with 
secretion signal and reflects the fact that the protein normally self-assembles into a two 
dimensional crystalline layer on the bacterium's surface. These aggregates are visible 

15 10 the naked eye and are readily collected by simple filtration. With simple water wash 
steps, residual bacterial cells are readily flushed away. It is routinely possible to 
achieve a protein puriw of 90% or bener with this simple purification procedure. 



DESCRIPTION OF THE PRIOR ART 



20 



Most current protein purification systems for recombinant proteins produced by 
bacteria rely upon an affmit\' matrix to achieve separation of the target protein and to 
concentrate the protein for subsequent steps of purification. To accomplish this, genes 
for recombinant proteins are commonly constructed so that they contain affinit\' tags, 
2 5 which are protein sequences that will bind to an affinity matrix. CommorUy used 
systems include the following: 

(a) glutathione S-transferase (GST) ug. which binds to glutathione-sepharose 
matrices: 



30 



(b) maltose binding protein (MBP) tag, which binds to amylose matrices; 
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(c) multiple tandem hisiidine residues (e.g. "His-6'') tag. wtnch binds to 

nickel-derivatized solid matrices: and 

5 (d) protein A tag, which binds to Immunoglobulin IgG-cten\"ai2asd sepharose or 
comparable matrices. 

Prior an techniques were typically developed so that remov-ai :f a target protein 
does not disrupt the tag and matrix association. Instead, enz>Tnes oar cleave specific 
10 sequences of amino acids are employed. The enzyme cleavage ssxesssx is positioned 
between the tag and the desired recombinant protein and enzyinaDc cieav^e is effected 
directly on the matrix with attached fusion protein. If a secretion ugml is used, the 
cleavage site is usually positioned such that the secretion signal li separated from the 
target recombinant protein during the cleavage step. The matrix is regoieraied for re- 
15 use only after the target recombinant protein has been purified awa^ fnxn the matrix. 
Typical enzymes used in these methods are Factor Xa, enterokinase hki coUagenase. 

Chemical cleavage is generally not used because the coocnc«ss required for 
cleavage will disrupt the billing of affmity tag and matrix or desiro> ±e matrix. When 
chemical cleavage is used with recombinant fusion proteins to cleave lirget protein from 
2 0 a secretion sigtial and/or affmity tag, solubilization and denamrnioQ processes are 
generally employed. The expectation is that complete or nearly ccoplete unfolding of 
the protein is a prerequisite for effective cleavage. 

Mild-acid cleavage is predicated on the inclusion, by happecrcance or cfesign, of 
the acid-sensitive aspartate-proline dipeptide at a desired site for clervage. The protein 

2 5 to be cleaved is typically exposed to conditions that solubQize and'or completely 

denature the protein prior to cleavage. The chaou-opic agent giianrime hydrochloride 
(used at 6-7 M) is commonly employed to denamre and solubilize lie protein prior to. 
or at the same time as acid treatment. Alternately, high concentratkns of acids that also 
serve as solubilizing agents (as examples: 70-90% formic acid, acetic ^nd [10%] 

3 0 pyridine, or relatively high concentrations of HCL (60 mM or nKre) are employed. 

Because such conditions would disrupt a lag/affmity matrix associaooii. direa cleavage 
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of an affinity lag from the target protein while a protein remains associated with an 
affinity matrix is not attempted. 

General conditions for cleavage at aspartate - proline sites are described in 
5 Current Protocols in Molecular Biology (supp. 28; chapter 16.4) John Wiley & Sons 
Inc. 1994, and in Landon, M. "Cleavage at Aspartyl - Prolyl Bonds" in Methods in 
Enzymology (1977) 47: 145-149. These references suggest that significant variability 
of cleavage conditions exist for different proteins and that cleavage might occur in some 
instances without first denaturing or solubilizing the protein. However, in practice, the 
10 latter circumstances are rare sxid proteins to be subjected to acid cleavage at Asp- Pro 
dipeptides are usually solubilized to a state where there is no visible mrbidii>'. Such 
solubilized protein will normally not pellet when centrifuged at 100,000 x g for 1 hour. 
It is now shown that mild-acid conditions may be used for cleavage of aspartate -prolme 
sites in Caulobacter S-layer fusion proteins without placing the protein in a solubilized 
1 E state as described above. 

SUMMARY OF INVENTION 

This invention is based on the unexpected discovery that recombinant fusion 
2 0 proteins produced by the Caulobacter S-layer protein secretion system can be cleaved 
under mild-acid conditions and solubilization of the fusion protein is not required. 
Cleavage may be accomplished while the fusion protein is in the form of an insoluble 
aggregate t>'pical of the Caulobacter S-layer protein. Cleavage occurs at aspartate- 
protein dipeptides which may be in a heterologous protein ponion of the fusion protein 

2 5 or in a ponion that is native to the Caulobacter S-layer portion. The dipeptide mav be 

placed at a desired location for cleavage by engineering DNA encoding the fusion 
protein to express the dipeptide at the desired location. A preferable location for 
cleavage may be at or near the junction between a heterologous (target) protein and the 
Caulobacter S-layer portion comprising the Caulobacter secretion signal, such that a 

3 0 cleavage product will be the target protein in its entirety and substantially free of 

extraneous amino acids. 
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The current invention makes it possible to cleave a heterologous (target) protein 
from the S-layer protein portion using only mild-acid conditions, even while the fusion 
protein is in an aggregated form. These cleavage conditions do not result in significant 
solubilization of the S-layer protein ponion. 

5 

This invention provides a method of cleaving a fusion protein including a first 
component which comprises all or pan of a Caulobacter S-layer protein including a 
Caulobacter C-terminal secretion signal, and a second component heterologous to 
Caulobacter. The fusion protein contains at least one aspartate -proline dipeptide. The 

10 method comprises combining the fusion protein with an acid solution of a strength 
insufficient to solubiltze the fusion protein for a time sufficient for cleavage of the 
fusion protein at the aspartate -proline dipeptide. The acid solution may have a pH of 
from about L5 (eg. 1.5 ± 0.1) to about 2.5 (eg. 2.5 ± 0.1), and preferably from about 
1.65 (eg. 1.65 X 0.05) to about 2.35 (eg. 2.35 ± 0.05). Preferred pH conditions may 

15 be achieved using an acid equivalent in the range of about 5 to about 20 mM HCL. 
The method is typically carried out at a temperamre in the range of approximately room 
temperature to about SCC. 

This invention also provides a method of preparing a DNA construct suitable for 
expression of a fusion protein suitable for use in the method of this invention. The 

2 0 method comprises joining an upstream DNA segment including DNA heterologous to 
Caulobacter which includes a protein of interest to a downstream DNA segment 
including DNA for a Caulobacter C-terminal secretion signal which does not eiKOde an 
aspartate -proline dipeptide. The upstream segment contains DNA encoding an 
aspartate-proline dipeptide at or near the junction between said upstream and 

2 5 downstream segments . 

This invention also provides a method of preparing a fusion protein, comprising 
the steps of expressing a DNA construct as described above in Caulobacter and 
recovering said fusion protein once secreted by the Caulobacter. 

Once cleavage is accomplished according to this invention, the S-layer portion 

3 0 comprising the Caulobacter secretion signal may remain as an insoluble aggregate. If 

the target protein is soluble, the S-layer ponion may be easily separated from the target 
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recombinant protein by simple centrifiigation or filtration methods. Thus the system of 
this invention facilitates separation as would a Tag/affinity matrix system except that 
here, the system is also the means for producing an insoluble matrix. In addition, the 
insoluble matrix produced by this invention is resistant to the effects of the acid 
treatment, allowing direct cleavage of the target recombinant protein. In this way, a 
ver>' inexpensive chemical cleavage method can be employed to economically retrieve 
recombinant proteins from a bacterial fusion protein. In contrast to the cost of most 
affuiity matrices, there is linle expense associated with the use of the S-layer secretion 
signal as it is simply a pan of the fermentation'^ secretion process. 
0 

DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 

Production of Recombinant Fusion Proteins Using 
the Caulobacter S-Iayer Secretion System 

5 

Proteins may be produced using the Caulobacter S-layer Type 1 secretion 
pathway which requires only the C-terminal secretion signal of the Caulobacter . This 
signal is the C-terminal portion of the S-layer protein, which typically comprises about 
200 amino acids. (See: Single, et al. (1997) [supra]] and, WO 97/34000). Additional 
0 Caulobacter S-layer DNA upsu*eam from the secretion signal may also be present and 
may be desirable to encode portions of the S-layer protein which will conu-ibute to 
aggregate formation of the secreted protein. Such additional Caulobacter DNA may 
constitute most or all of the remaiiKier of the DNA encoding the S-layer protein. 

Standard techniques (such as methods described in WO 97/34000) may be used 
5 to identify' the amount of the C-terminal ponion of a particular Caulobacter S-layer 
protein which functions as the secretion signal. 

Creation of fusion proteins is commonly done by preparing DNA which codes 
for the target protein and fusing it in-frame with the C-terminal region of the S-layer 
gene. There are numerous possible methods, with the following being examples. 
0 I- Oligonucleotide Chemical Synthesis. This involves the design of 
complementar\' single strands, complete with desirable restriction endonuclease cut sites 
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at the ends, chemical synthesis of the strands followed by annealing, cloning into a 
plasmid vector, juxtaposed to an appropriate portion of the C-terminal region of the S- 

layer gene. 

2. Production of the Target Gene DNA by Polymerase Chain Reaction (PCR) 
5 Amplification of a Target Sequence. In this case, appropriate in-firame restriction 

sites are incorporated into the shon oligonucleotides used for amplification of a target 
sequence, such that the final PCR product can be treated with the appropriate restriction 
enzymes (to create the restriction site "sticky ends"), followed by cloning into a plasmid 
vector, juxtaposed to an appropriate portion of the C-terminal region of the S-layer 
10 gene. 

3. Adapting Restriction Endonuclease Cleavage Sites that are Native to a 
Target Protein Gene Sequence for Fusion to the DNA Coding for the C-terminal S- 
layer Secretion Signal to Accomplish In-frame Expression of a Chimeric Protein. 

15 This can be accomplished by direct ligation (although it is uncommon that an 
appropriate match will occur), or the use of adapter sequences or methods involving 
blunting of a resu-iction site and subsequent blunt-end ligation to change expression 
reading frame or join unlike restriction site sticky ends. 

There will be numerous convenient sites for fusion with the C-terminal regions 

2 0 of the S-layer that lead to the successfiil expression, secretion and aggregation of a 
recombinant fusion protein. Some example positions are at or near the DNA sites 
corresponding to amino acids 622, 690. 784, 892 and 907 of the C. crescenms S-layer 
gene (see: Appendix 1 and, WO 97/34000). Other sites of fusion with the S-layer gene 
may also be employed. Most often a plasmid vector is designed such that the C- 

2 5 terminal gene segment is resident on a plasmid with appropriate restriction sites placed 
at the N-ierminal junction of the S-layer fragment. Target recombinant protein gene 
segments are then cloned into those restriction sites. It is typical to prepare initial 
plasmid consimcts that are replicated in E.coli . After a construct is produced, it is 
typically transferred to a broad host range plasmid which can then be introduced into 

3C the appropriate Caulobacter strain by electroporation. Suitable broad host range 
plasmids can be constructed from (but are not limited to) the IncQ, IncW and IncPl 
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plasmid incompatibility groups. 

The introduction of the aspartate-proline (Asp-Pro) dipeptide at the appropriate 

site in the fusion protein can be done in several ways. Some examples are: 

(a) incorporating a DNA sequence necessary to express the Asp-Pro 
dipeptide into the oligonucleotides used to prepare the target sequence, either by 
oligonucleotide synthesis or PCR methods; 

(b) preparing a DNA segment with appropriate restriction sites at the termini 
so that an Asp-Pro dipeptide can be introduced (most often at the junction between S- 
layer and target gene) after a fusion recombinant S-layer gene has been made; and 

(c) use of a native Asp-Pro dipeptide in either the target DNA or the S-layer 
segment (for example, an Asp-Pro dipeptide is located ai amino acids 692 and 693 of 
the C. crescenms S-layer gene and is suitable for fusions made at the amino acid site). 

The methods described above are not the only methods that may be used for 
creating and expressing fusion recombinant S-layer proteins, nor is it necessar\' to have 
the engineered genes resident on a plasmid. For example, the expressed gene may be 
introduced into the chromosome (using well-known gene insertion or replacement 
techniques) and still achieve secretion of the recombinant proteins (see WO 97/34000). 
2 0 In some cases it may be desirable to produce recombinant fusion proteins as insertions 
of heterologous DNA in the middle of the S-layer gene. In such a case, Asp-Pro 
dipeptide sequences could be engineered at the N and C-termini of the target pepude. 

All possible codon combinations for Asp-Pro will work but the CCA codon for 
proline is not preferred due to the likelihood of a low amount of the corresponding 
2 5 tRNA being present in Caulobacter . The following is an approximate usage table for 
C. crescentus. 
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Caulobacter crescentus Codor Usa^s 
(Amino Acid] [TnpJel Code] [FrecxeTcr.. ^ T>xx*sand] 

Phe UUU 2.5 Ser jCX t -2 Try UAU 6,6 Cys UGU 0 6 

PheUUC 27.0 Se-^CC 2-c Try UAC 9.6 Cyi UGC 5 5 

LeuUUA 0.0 Ser CA StOP UAA 0.8 Cys UGA 16 

LeuUUG 4.4 Ser UCG 25.7 STOP UAG 0.6 STOPUGG 712 

LeuCUU 4.4 Pn:CC^ 2-5 His CAU 3.2 Ara CGU 7 6 

Leu cue 15.7 His CAC 12.2 Arg CGC 44.7 

LeuCUA 11 £IE GInCAA 3.7 A^ CGA 3.0 

LeuCUG 72.3 Pm ZCC- r?.i Gin CAG 30.2 Arg CGG 12.1 

HeAUU 2.4 Tn: acx^ '2 Asn AAU 4.1 Ser AGU 0 8 

IteAUC 49.0 TtrACC 57^ Asn AAC 23.8 SerA^ 14 9 



lleAUA 0.3 lyrXCA tl£ Lys AAA 2.7 Arg AGA 0 4^ 

Met AUG 25.7 Thr iCG VBJ^ LysAAG 37.9 Arg AGG ^ ^ 

ValGUU 5.4 Ate 30^ 9 = Asp GAU 11.1 Gly GGU 9.5 

ValGUC 42.7 A;o 3CC W.i Asp GAC 48.5 G»y GGC 64 8 

ValGUA 1.0 A:^30 GluGAA 20.5 Gly GGA 2.3 

Vai GUG 30.7 Ais 3CG 36.7 Glu GAG 45.4 Giy GGG 7 7 
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Large quaniiiicb (eg. 12% of total cell proteiii/3% of input organic carbon) of a 
wide range of proteins can be produced, with yields in the order of 250 mg/liter of 
batch culture. Fusion proteins with 35 kDa of target peptide are secreted with little 
5 difficult} , although proteins with multiple cysteines may be more difficult to express. 
Post-expression glycosylation of proteins does not occur, an advantage for most peptide 
expression applications. 

: Host Expression Strains 

For secretion of recombinant fusion S-layer proteins, the Caulobacter strain will 
preferably be one which has lost the ability* to produce a native S-layer protein, while 
retaining a fully functional S-layer protein secretion apparatus. Such strains may be 

15 obiamed by screening for mutants that have spontaneously become S-layer protein 
negative: or, by directed genetic manipulation, such as (but not limited to) the insertion 
of a drug resistance cassette in the middle of the S-layer gene or the subsiimtion of a 
version of the S-layer gene which has had a sizeable internal region deleted from the 
gene (see: Bingle et al. 1997' [supra]: Single ec al. 1997- "Cell Surface Display of a 

^-^ Pseudonomonas aerugenosa PAK Pilin Peptide with the Paracrystalline Layer of 
Caulobacter crescenms " Molec. Microbiol. 26:277-288; and, Edwards and Smit (1991) 
A Transducing Bacteriophage for Caulobacter us Uses the Paracrystalline Surface 
Layer Protein as a Receptor'^ J. Bacteriol, 173: 5568-5572). In the case of a genetic 
manipulation, a common method for producing such strains is to modify a copy of the 

2 5 S-layer gene while on a plasmid and then to use well known gene replacement methods 

to substitute the modified gene for the native gene in the Caulobacter chromosome (see: 
Edwards and Smit (1991) [supra]). 

If an entire S-layer gene is to be used for production of a recombinant protein 
(via insertion of a target sequence), strains defective in the production of the 

3 0 lipopolysacharide (LPS) used for S-layer attachment to the bacterial surface can be 

used. These can be prepared by forcing Caulobacter to grow without exogenous 
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calcium. Under these conditions mutants arise that are uniformly defective in 
producing a proficient version of the S layer LPS (see; Walker, S.G. e:t aL (1994) 
"Characteristics of Mutants of Caulobacter crescentus Defective in Surface Attachment 
of the Paracrystaline Layer" J. Bacteriol. 176: 6312-6323). 
5 All Caulobacter S- layer producing strains are suitable for this technology. One 

may isolate the S-layer gene from a particular strain (using homology between 
Caulobacter S-layers to design probes to detect and clone the S-layer gei^) and adapt 
the C-terminal region for recombinant protein expression, in a manner similar to that 
done for C crescentus strains (see: MacRae and Smit (1991) [supra], and Walker, S.G. 
10 et al. (1992) [supra]). Alternatively, one may construct recombinant fusion S-layer 
genes using the C. crescenms S-layer gene and express the recombinant genes in 
alternate Caulobacter hosts. 

Freshwater Caulobacter producing S-layers may be readily detected by negative 
stain transmission electron microscopy techniques. Caulobacter may be isolated using 
15 the methods outlined by MacRae and Smit (1991) [supra], which take advantage of the 
fact that Caulobacter can tolerate periods of starvation while other soil and water 
bacteria may not and that they all produce a distinctive stalk structure, visible by light 
microscopy (using either phase contrast or standard dye staining methods). Once 
Caulobacter strains are isolated in a typical procedure, colonies may be suspended in 
2 0 2% ammonium molybdaie negative stain and applied to plastic-filmed, carbon-stabilized 
3(X3 or 400 mesh copper or nickel grids and examined in a transmission electron 
microscope at 60 kilovolt accelerating voltage (see: Smit, J. (1986) "Protein Surface 
Layers of Bacteria", in Outer Membranes as Model Systems , (M. Inouge, ed. J.Wiley 
& Sons, at p. 343-376). S-layers are seen as t\^'o-dimensional geometric patterns most 

2 5 readily on those cells in a colony that have lysed and released their internal contents. 

Recombinant Protein Purification 

Secreted proteins are separated and shed into the culture media as a macroscopic 

3 0 precipitate (the ''aggregate" referred to herein). The shedding phenomenon is a 

consequence of the absence of the N -terminal region of the S-layer protein in the 
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expressed recombinant protein, or the loss of the lipopolysaccharide species used for S- 
layer attachment by the Caulobacter (see: Walker, S.G. et al. (1994) [supra]). 
Typically, the aggregate forms as loose, gel-like lumps of pure protein that can readily 
be retrieved and separated from the bacteria by simple filtration. 
5 The aggregate may be readily separated from a soluble cleaved target protein by 

any suitable techniques such as filtration of centrifugation. If the target protein is 
insoluble once cleaved, it may then be convenient to then solubilize one or both of the 
proteins (for example in 8M urea or 6M quanidine HCL) and separate by 
chromatography. In this way, only 2 species of protein need to be separated. 

Cleavage of Fusion Proteins 

General procedures for performing mild-acid cleavage are known from in the 
prior art as described above. In the method of this invention, conditions are adjusted to 
15 avoid destruction of the target protein or solubilization of the aggregate containins the 
S-layer secretion signal. Excess acid or too high a temperature may increase the 
occurrence over time of random cleavages along the length of the fusion protein, which 
is to be avoided since such random cleavages may lead to undersized fragmentation of 
the fusion protein or solubilization of the aggregated S-layer portion. 

20 

Good yields of target protein with minimum random breaks in the fusion protein 
may generally be achieved by using from 5-20 mM HCL (or its equivalent while 
employing another acid). The respective pH of these conditions (unbuffered acid 
solution) is from about 2.3 to about 1.7. Time and temperaaire is preferably adjusted 

2 5 by routine monitoring to achieve the desired cleavage while minimizing random breaks. 

For example, temperature may range from room temperamre to about 50** C. Time of 
treatment may range from about 12 to about 72 hours. Time or temperature outside of 
these ranges is permissible depending upon the su-ength of the acid and the accepted 
yield. General!} . lower yields are obtained with less acid strength, less time or lower 

3 0 temperatures. 

In the following examples, efficiency of cleavage in the order of 40-80% is 
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achieved using conditions the same as or similar to the following alternatives: 

- 5 mM HCL at 5(f' C. for 48-72 hours 

- 20 mM HCL at 30^ C. for 48-72 hours. 

Conditions in excess of the aforementioned values may be employed in some 
5 cases with the possibility of random breaks increasing, panicularly with increased acid 
strength or temperature. In the following examples, significant random cleavage 
occurred with 50 mM HCL at 50° C. after 48 hours. 

.\ny acid may be employed in this invention which is normally used in solutions 
CO which proteins are exposed. Acids which have a deleterious effect on proteins under 
10 dilute conditions should be avoided. For example, HCL or an equivalent amount of 
H.SOa may be used in this invention but oxidizing acids such as nitric acid may not be 
suitable. 

Example 1 > Cleavage of artificial silk protein sequences 
1 E from a secretion signal containing a native aspartate-proline cleavage site. 

.\n anificial protein sequence resembling spider silk was constructed by 
synthesis of partially overlapping and complementing oligomers of DNA, which were 
then completed to a full duplex DNA with Taql polymerase extension, to create a 
2 0 sequence that coded for 97 amino acids. The resulting DNA sequence and 
corresponding amino acid sequence are shown in Appendix 2. 

The DNA sequence shown in Appendix 2 was cloned into a gene carrier 
sequence residing in a pUC8 plasmid cloning vector. The gene segment carrier had 
BamHI restriction sites at each end and an internal Bglll site. This combination of 

2 5 restrictions sites allowed the production of multimers of the above sequence, relying on 

the fact that BamHI stick)* ends will ligate into Bgin sticky end, with the loss of both 
restriction sites. Thus one copy of the silk-like sequence within the gene segment 
carrier can be put inside a second copy of the same to produce a dimer. Using this 
principle, an 8X repeat was produced, fused to DNA encoding the S-layer secretion 

3 0 signal corresponding to the C-terminal portion of the C. crescenms S-layer protein from 

about amino acid 690 onwards (see: Appendix 1). This fusion protein gene was 
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iniroduced into strain CB2A on a broad host range plasmid vector. The 8x mukixner 
appeared to be unstable, resuuing in reccir.bination events that reduced the 8X multimer 
to a 3x size. The 3 fold repeat of the above 97 amino acid sequence, fused to the S- 
layer secretion signal was secreted. Protein was collected and subjected to treatment 
5 with 5mM HCL for 2 days at 5(f C. The result was the liberation of about 80% of 
soluble silk-like polymer which was readily separated by filtration from the S layer 
protein which remained completely aggregated under these conditions. Cleavage 
occurred at native aspartate-proline dimer in the Caulobacter S-layer signal region (see: 
Appendix L amino acids numbered 692-693). 

10 

Example 2. Cleavage of the salmonid virus Infectious Pancreatic Necrosis 
Virus (IPNAO surface glycoprotein candidate vaccine sequence from an 
S-layer secretion signal containing a native aspartate-proline site. 

15 

The surface glycoprotein of the IPNV strain is a vaccine candidate. For this 
example and Example 4, the sequence of the first 257 amino acids of the mature protein 
and the corresponding DNA sequence as shown in Appendix 3 were used. 

DNA encoding a segment of the major surface glycoprotein gene of IPNV 
2 0 specifying amino acids 145-257 of the protein was fused to DNA sequence specifying 
two putative T-cell activating epitopes: MVP (SEQ ID No:l; I^EKGVIVHRLEGV, 
derived from Measles Virus protein F) and P2 (SEQ ID No:2; QYIKANSKFIGITEL, 
derived from tetanus toxoid protein). The T-cell epitopes were positioned on the C- 
terminal end of the IPNV sequence. This chimeric protein was in turn fused in frame 

2 5 with the C-crescentus S-layer gene at about amino acid 690 position of the gene and 

introduced into Caulobacter on a broad host range plasmid vector. The resulting 
secreted protein was collected and treated with 5 mM HCL for 2 days at 50*^ C. 
Cleavage occurred at the native aspartate-proline dimer described in Example 1 . The 
result was the liberation of about 75% of soluble vaccine candidate chimeric protein 

3 0 from the S-layer secretion signal which remained aggregated. 
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Example 3. Cleavage of segments of an E. arfi type ! pilus tin subunit fircMn 
an S-layer secretion signal crataining a native aspartate-proline cleavage site, 

5 The FimH gene product is the tip pilus subunit of the E. coli strains involved 

with urinary tract infections. Two segments, T3 (specifying the first 145 amino acids 
of the mature peptide) and T7 (specifying the entire 258 amino acids of the mature 
peptide) were fused to the S-layer secretion signal at abCHJt amino acid 690 of the 
S-layer sequence. The T3 and T7 sequences are shown in Appendix 4. 
0 The fusion protein genes were introduced into strain CB2A on a broad host 

range plasmid vector. In both cases the resulting secreted protein was collected and 
U-eated with 5 mM HCL for 2 days at 50° C. In both cases, the result was the liberation 
of about 50% of soluble vaccine candidate chimeric protein from the S-layer secretion 
signal which remained aggregated. Cleavage occurred at the native aspartate-proline 

1 5 dimer described in Example 1 . 

Example 4 . Cleavage of the salmonid virus IPNV surface glycoprotdn 
candidate vaccine sequence from an S-layer secretion signal containing 
an introduced aspartate-proline cleavage site, 

2 0 

A segment of the major surface glycoprotein gene of IPNV specifying amino 
acids 1-257 of the protein shown in Appendix 4 was fiised to a DNA sequence 
specifying a peptide containing an aspartate-proline dipeptide (SEQ ID No: 3; 
SPLGPAGDPEAS) such that the aspartate-proline dipeptide was positioned very near 

2 5 the C-terminus of the chimeric protein. This chimeric protein was in mm fused in 

frame with the C. crescentus S-layer gene at about amino acid 784 position of the gene 
and introduced in strain CB2A on a broad host range plasmid vector. The resulting 
secreted protein was collected and treated with 5 mM HCL for 2 days at 50° C. 
Cleavage occurred at the introduced aspartate-proline dipeptide. The result was the 

3 0 liberation of about 40% of insoluble vaccine candidate chimeric protein from the S- 

layer secretion signal which remained aggregated. 

Lx>nger DNA and amino acid sequences referred to above are set out in the 



• 
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following Appendices which are part of this description. Appendix 1 sets out the 
complete nucleotide sequence of the C. crescentus S-layer gene (SEQ ID No: 4) with 
the upstream sequence including the -35 and -10 sites of the promoter region and the 
Shine Dalgamo sequence. The start codon is at nucleotide 101 and the coding sequence 

5 run to and includes nucleotide 3179. The amino acid sequence of the C. crescentus S- 
layer protein (SEQ ID No: 5) included in Appendix 1 is predicted from the DNA 
sequence. Appendix 2 sets out the artificial spider silk DNA sequence (SEQ ID No:6) 
used in Example 1 and the corresponding amino acid sequence (SEQ ID No. 7). 
Appendix 3 sets out the DNA sequence (SEQ ID No: 8) and corresponding amino acid 

0 sequence (SEQ ID No: 9) of the first 257 amino acids of IPNV as described in 
Examples 2 and 4. Appendix 4 sets out the T3 protein sequence (SEQ ID No: 10) and 
the T7 protein sequence (SEQ ID No: 11) as described in Example 3. 

All publications, patents and patent applications referred to herein are hereby 
incorporated b> reference. While this invention has been described according to 

5 particular embodiments and by reference to certain examples, it will be apparent to 
those of skill in the an that variations and modifications of the invention as described 
herein fall within the spirit and scope of the attached claims. 



